CDS

Accession Number TCMCG075C30314
gbkey CDS
Protein Id XP_017984870.1
Location complement(join(20502489..20502998,20503097..20503148,20503575..20503621,20503784..20503901,20504843..20504934,20505013..20505105,20505593..20505803,20505899..20506222,20506550..20506764,20506875..20507231,20507807..20507815))
Gene LOC18587364
GeneID 18587364
Organism Theobroma cacao

Protein

Length 675aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018129381.1
Definition PREDICTED: integrator complex subunit 9 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category A
Description Integrator complex subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01001        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13146        [VIEW IN KEGG]
ko:K13420        [VIEW IN KEGG]
EC 2.7.11.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04016        [VIEW IN KEGG]
ko04626        [VIEW IN KEGG]
map04016        [VIEW IN KEGG]
map04626        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAAGTTTACATGCCTTCGTAAAGGTGGTGGTTTCCATTTCCCAGCATGTCATATGCTCAATGTATCTGGGTTTAGGATCTTACTTGATTGCCCCTTGGACCTTTCATCTCTTGCTTTTTTTTCTCCTGTTCCAGTGGCTCATGAGGCCCATAAGTCTTTGGATACTGACTCGGTTATTAGGAAAAAGCAGAAGATGGAAAAGGCTCTTGATGCAAATGATTTAGTACATGCAGAGCCTTGGTATAAAACTGTAAAAAGTTTGCACCTATGGGATGCTTCCTTTATTGACGTTGTTTTGATCTCAAGTCCTATGGGCATGCTTGGCTTGCCGTATCTTACTCGGACCAAGGACTTTTCTGCAAAGATATATGTGACTGAAGCAACTGCAAGAATAGGACAGCTTTTGATGGAGGATCTTGTTTCAATGCACATGGAATTCAGGCAATTTTATGGACCAGAGGATTCTTGTTTCCCTCAATGGTTGAGGTGGGAAGAACTTGAAGTTCTTCAATCTGAAATGAAGAAAATAGCTTTAGGCAAAGATTGTGAGGAGCTGGGAGCTTGGATGCCTTTGTACAGTGCAGATGATGTGAAGGATTGCATGAGGAAGGTTCAAACGCTAAAATATGCTGAAGAAGCTTGCTACAATGGAACTTTGATTATAAAAGCATTCAGCTCTGGTTTGGAAATTGGGACCTGCAATTGGACAATAAATGGTCCAAAGAGAAATATAGCTTATATTACGAACTCTATTTTTGTTTCTACACATGCGGCAGATTTTGATTTCGTTGGTCTTCGAGGGAATGATTTGATAATATATTCAGATTTCTTCTCCCTTGGTGCTGCAGAAAATATGGAGAATGATAATACTTACTTTGATCCAGTTGCTTCGTTAAATTTCAGCGATGATGTCAACAATTTGGAAGAGATGTCTGCATCCTTGCTGAAGGATGATGAAAGTACGGAGGAAATGGAGAAACTAGCTTTTATATGTACCTGTGCCCTTGATTCTGTTAGAGGAGGTGGATCAGTTCTTATTCCTATTGATCGGCTTGGAATCATTCTGGGCCTTTTGGAGCAAATGTCAGTTTTGTTGGAGTCTTCATCTGCAAAGGTTCCCATGTACATTATATCTTCTGTAGCAGAAGAATTATTGGCATTTACCAATATAATACCAGAGTGGCTCTGCAAGCAGCGGCAAGAGAAGCTTTTTTCTGGTGAACCATTGTTTGAACATGCCAAGCTCATAAAAGAGAGGAACATTCATGTGTTTCCTGCAGTTCATTCACCTGAATTATTAACCAATTGGCAGGAACCTTGTATCATATTCTCTCCTCACTGGAGTTTGCGGCTCGGTCCAGTTGTTCATTTGCTTCGGCATTGGTGCTCAGATCCAAACTCTTTACTTGTCCTTGAGCCAATGGCAATGAAGGTTCTTCAGTGTTCATTTCTGTCTGGAATGAGGTTGCAGAAAGTTCAACCCTTACTAAAGACATTGCAGCCAAAATTAGTTCTGTTTCCCAAGGATTTGAGGTGCAAGATCCAAATTTTAGAAGCAAACACGATTTTTCTCTACTCTGAAAATGAAACATTACGTATACCAAGCTCAAAGAATAGCACAGAAATAGAGATTGCAACAGATTTGGCTTCCAAGTTCCACTGGAAAACATTGAAGCAGGAAACAATCACGAGGCTGGAGGGAGAGCTTTTCATGGATTATGGCAAACATCGGCTACTTTCTGGGTCCCATCCAGCAGACTCCAAGCAACAAAGACCATTAGTACACTGGGGTTCACCAGATTCGAAAGGGCTTCTGACTGAGCTGTCAAAGATAGGTATTAACGGAACCATAAAACAAGTCAGGGATGATACTGAATCTGAAAGTGCTGCCGGTGTTGTAGAAATCCATGAGCCCAAGAAAGCCGTGATCCATGTGGGAGAAACTGGTACTGTTATCATTTCTGCCGACGAGAATTTAGCCTCCCATATTGTCACAGCTATAGATATTGTTTTGGATGGCATTTAA
Protein:  
MKFTCLRKGGGFHFPACHMLNVSGFRILLDCPLDLSSLAFFSPVPVAHEAHKSLDTDSVIRKKQKMEKALDANDLVHAEPWYKTVKSLHLWDASFIDVVLISSPMGMLGLPYLTRTKDFSAKIYVTEATARIGQLLMEDLVSMHMEFRQFYGPEDSCFPQWLRWEELEVLQSEMKKIALGKDCEELGAWMPLYSADDVKDCMRKVQTLKYAEEACYNGTLIIKAFSSGLEIGTCNWTINGPKRNIAYITNSIFVSTHAADFDFVGLRGNDLIIYSDFFSLGAAENMENDNTYFDPVASLNFSDDVNNLEEMSASLLKDDESTEEMEKLAFICTCALDSVRGGGSVLIPIDRLGIILGLLEQMSVLLESSSAKVPMYIISSVAEELLAFTNIIPEWLCKQRQEKLFSGEPLFEHAKLIKERNIHVFPAVHSPELLTNWQEPCIIFSPHWSLRLGPVVHLLRHWCSDPNSLLVLEPMAMKVLQCSFLSGMRLQKVQPLLKTLQPKLVLFPKDLRCKIQILEANTIFLYSENETLRIPSSKNSTEIEIATDLASKFHWKTLKQETITRLEGELFMDYGKHRLLSGSHPADSKQQRPLVHWGSPDSKGLLTELSKIGINGTIKQVRDDTESESAAGVVEIHEPKKAVIHVGETGTVIISADENLASHIVTAIDIVLDGI